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Abstract. A new type of gradient estimate is established for diffusion semi- 
groups on non-compact complete Riemannian manifolds. As applications, a 
global Harnack inequality with power and a heat kernel estimate are derived 
for diffusion semigroups on arbitrary complete Riemannian manifolds. 



1. THE MAIN RESULT 

Let M be a non-compact complete Riemannian manifold, and P t be the Dirichlet 
diffusion semigroup generated by L — A + W for some C 2 function V. We intend 
to establish reasonable gradient estimates and Harnack type inequalities for P t . In 
case that Ric — Hessy is bounded below, a dimension-free Harnack inequality was 
established in [15] , which according to p~7) , is indeed equivalent to the corresponding- 
curvature condition. See e.g. [2] for equivalent statements on heat kernel functional 
inequalities; see also [El GUIS] f° r a parabolic Harnack inequality using the dimension- 
curvature condition by shifting time, which goes back to the classical local parabolic 
Harnack inequality of Moser [10] . 

Recently, some sharp gradient estimates have been derived in [13l US] for the 
Dirichlet semigroup on relatively compact domains. More precisely, for V — and 
a relatively compact open C 2 domain D, the Dirichlet heat semigroup P 4 satisfies 

(1.1) \VP t D f\(x)<C(x,t)P t D f{x), x€D,t>0, 

for some locally bounded function C: D x ]0,oo[ — * ]0, oo[ and all / G , the 
space of bounded non-negative measurable functions on M . Obviously, this implies 
the Harnack inequality 



(1.2) P t u f(x) < C(x, yi t)P t u f(y), t > 0, x,y e D, f e 



for some function C : M 2 x ]0, oo[ — > ]0, oof. The purpose of this paper is to establish 
inequalities analogous to (jl.ip and (|1.2[1 globally on the whole manifold M. 

On the other hand however, both (jl.ip and (|1.2[1 are in general wrong for P t 
in place of P/ 3 . A simple counter-example is already the standard heat semigroup 
on R d . Hence, we turn to search for the following slightly weaker version of gradient 
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estimate: 

(1.3) |VP t /(aO| <S(P t f log/- P t f log P t f)(x) +C&$-P t f(x) , 

x e M, t > 0, S > 0, / G 38%, 

for some positive function C : ]0, oo[ x M — > ]0, oo[. This kind of gradient estimate 
is new and, in particular, implies the Harnack inequality with power introduced in 
[T5] (see Theorem O below). 

Theorem 1.1. There exists a continuous positive function F on ]0, 1] x M such 
that 

\VP t f(x)\ < 5(P t f\ogf -P t f log P t f) (x) 

(1-4) + (^Al,,)(^l Ty + l) + ^)p t /(,), 

6 > 0, x e M, t > 0, / g 
Theorem 1.2. There exists a positive function C G C(]l, oo[xM 2 ) smc/i t/ia£ 

(Pt/W) a < (P t / a (»))e Xp |fc^+aa(a >a:> y) ( i } + pfav)) }, 

a > 1, t > 0, x,y e M, f G 

where p is the Riemannian distance on M . Consequently, for any 8 > 2 there exists 
a positive function C$ G C([0,oo[ x M) such that the transition density p t (x,y) of 
P t with respect to /i(dx) := e v ^ x 'dx, where dx is the volume measure, satisfies 

{ x ^ exp {- P (x, y) 2 /(2St) + C s (t, x) + C s (t, y)} 
Pt{x,y) < = -, x,y G M, t G J0,1[. 

<J»(B(x,V2i))»(B(y,V2i)) 

Remark 1.1. According to the Varadhan asymptotic formula for short time behav- 
ior, one has lim^o 4tlogpt(x, y) = —p(x,y) 2 , x ^ y. Hence, the above heat kernel 
upper bound is sharp for short time. 

The paper is organized as follows: In Section [2] we provide a formula expressing 
Pt in terms of Pf and the joint distribution of (r, X T ), where X t is the L-diffusion 
process and r its hitting time to dD. Some necessary lemmas and technical re- 
sults are collected. Proposition 12.51 is a refinement of a result in [19] to make the 
coefficient of p{x,y)/t sharp and explicit. In Section [3] we use parallel coupling of 
diffusions together with Girsanov transformation to obtain a gradient estimate for 
Dirichlet heat semigroup. Finally, complete proofs of Theorems 11.11 and 11.21 are 
presented in Section [5] 

To prove the indicated theorems, besides stochastic arguments, we make use of 
a local gradient estimate obtained in [T3] for V = 0. For the convenience of the 
reader, we include a brief proof for the case with drift in the Appendix. 

2. Some Preparations 

Let X s (x) be an L-diffusion process with starting point x and explosion time 
£(x). For any open C 2 domain D C M such that x G D, let t(x) be the first 
hitting time of X s (x) at the boundary dD. We have 

P t f(x) = E [f(X t (x)) l {t<ax )}] , P D f(x) - E [f(X t (x)) l { t<r( X ) } ] ■ 
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Let pP(x, y) be the transition density of Pf with respect to \x. 

We first provide a formula for the density h x {t 1 z) of (t(x), X t ( x ^ (x)) with respect 
to dt ® v{dz), where v is the measure on dD induced by n(dy) :— e v ^ v 'dy. 

Lemma 2.1. Let K(z,x) be the Poisson kernel in D with respect to v. Then 

(2.1) MM)= / {-d t pf(x,y))K(z,y)n(dy). 

Jd 

Consequently, the density s t— > t x (s) of t(x) satisfies the equation: 

(2-2) 4(«)= / (-9a£(x,yj) fx(dy). 

Jd 

Proof. Every bounded continuous function / : dD — > K extends continuously to a 
function h on D which is harmonic in D and represented by 



h(x) = / K(z,x)f(z)u(dz). 

JdD 

Recall that z i— > K(z,x) is the density of X T ( x )(a;). Hence 



E[/(X r(je) (aO)] = /j(.t) = / i/(dz). 

On the other hand, the identity 

h(x) = E[h(X tAT(x) )\ 

yields 

h(x)= / Pt{x, y)h(y) n(dy) + / i/(d^) / h x (s,z)f(z)ds 
Jd Jod Jo 

= I P?{x,y)[ I K(z,y)f(zMdz)\ M (dy)+ / / ftx(«,«)/(z)d« 

Jr> wai) / Jod 

rt 

-,Di 



f(z)[ / p t (x,y)K(z,y)n(dy) + / ft I (s,2)ds h/(dz), 
3D \Jd Jo / 

which implies that 

(2.3) K(z,x)= pf{x,y)K{z 1 y) fi{dy) + h x (s,z)ds. 

Jd Jo 

Differentiating with respect to t gives 

(2.4) h x (t, z) = -d t f p?(x, y)K(z, y) n{dy). 

Jd 

Since dtpf{x : y) is bounded on [e, e -1 ] x D x D for any e € ]0, 1[ , Eq. (|2.ip follows 
by the dominated convergence. 

Finally, Eq. (|2.2[) is obtained by integrating (|2.1j) with respect to ^(dz). □ 

Lemma 2.2. TTie following formula holds: 

Ptf(x) = i^/fr) + / P t - S f{z)h x {s, z) dsv{dz) 

J]0,t]xdD 

= P t D f(x)+ f P t - s f{z)PR 2 h.{s/2,z){x)dsv{dz). 

J]0,t]xdD 
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Proof. By the strong Markov property we have 
P t f(x)=E[f(X t (x))l {t<6(x)} ] 
= E[f(X t (x))l {t <t(x)}\ + E[f(X t (x))l {T(x) 
(2.5) = P Df( x ) +E \e [f(X t (x))l Mx)<t< ^ x)} \(r(x),X T{x) (x))] 



= P t n f(x) + / P t . s f{z)h x {s,z)dsv{dz). 

J}0,t]xdD 

Next, since 

d sP ° (x,y)=Lpf(.,y)(x) = LP s D /2 pf /2 (.,y)(x) 

= P?/2(Lp? /2 (-,y))(x) = P° 2 (d u pZ(-,y)\ u=s/2 )(x), 
it follows from (|2.ip that 

(2.6) h x {s,z) = P s D /2 h.(s/2,z){x). 

This completes the proof. □ 



We remark that formula (|2.6[) can also be derived from the strong Markov prop- 
erty without invoking Eq. (|2.1[) . Indeed, for any u < s and any measurable set 
A C dD, the strong Markov property implies that 

P{t{x)>s, X t(x) (x) E A} =e\(1 {u<t{x)} P {t(x) > s, X t{x) {x) G A\& u ) 

p%(x,y)F{T(y) > s-u, X T[y) {y) e A) fi(dy), 

J D 

and thus, 

h x (s,z) = P,®h.(s - u,z)(x), s>u>0, x £ D, z&dD. 

Lemma 2.3. Let D be a relatively compact open domain and pgr) be the Riemann- 
ian distance to the boundary dD. Then there exists a constant C > depending on 
D such that 

P{t(x) <t}< C*c- p ^ (a:)/16t , x e D, t>0. 

Proof. For x £ D, let R :— pod{x) and p x the Riemannian distance function to x. 
Since D is relatively compact, there exists a constant c > such that Lp x < c holds 
on D outside the cut-locus of x. Let j t '■= Px(Xt(x)), t > 0. By Ito's formula, 
according to Kendall [7], there exists a one-dimensional Brownian motion 6 t such 
that 

d~/ 2 < 2V2-/ t db t + cdt, t< t(x). 
Thus, for fixed t > and <5 > 0, 

K2 



Z s := exp ^- 7s 2 - -cs ~ 4— J 7 2 duJ , s < t(x) 
is a supermartingale. Therefore, 
P{t(x) <t} = P | max 1sAt[x) > r\ < P | max Z sAt[x) > c ^ /t-Sc-^R^/t^ 

< exp fc6 - i(<5i? 2 - 4S 2 R 2 )j . 
The proof is completed by taking S := 1/8. □ 
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Lemma 2.4. On a measurable space (E, JF, p) satisfying p.(E) < oo, let f G L 1 (p) 
be non-negative with p(f) > 0. Then for every measurable function ip such that 
ipf G L l (p), there holds: 



(2.7) 



/ M d£ < [ f log ^777 d/i + p(f) log / d/i. 



Proof. This is a direct consequence of |12) Lemma 6.45. We give a proof for com- 
pleteness. Multiplying / by a positive constant, we can assume that p(f) = 1. If 
J E dp — oo , then (|2.7p is clearly satisfied. 

If Jg d/i < oo, then since J E e^ d/i > Jy >0 j d/i, we can assume that / > 
everywhere. Now from the fact that e* 1 j € L 1 (//i) , we can apply Jensen's inequality 
to obtain 

log Qf d/i) = log Qf /d/i) > jf log (e*j) /d/i 
(note the right-hand-side belongs to RU {— oo}). To finish we remark that since 

logfe^ W = I ipfdfi- [ /log /d/i. 
Je J e 



f 



□ 



Finally, in order to obtain precise gradient estimate of the type (|1.4|) . where 
the constant in front of p{x,y)/t is explicit and sharp, we establish the following 
revision of pjH Theorem 2.1]. 

Proposition 2.5. Let D be a relatively compact open C 2 domain in M and K a 
compact subset of D. For any e > 0, there exists a constant C(e) > such that 

| V1 „ gP f (.,,) W | < c <^ + '"> + &±iMm>, 

(2.8) t e ]0,1[, xeK, yeD. 

In addition, if D is convex, the above estimate holds for e — and some constant 

C(0) > o. 

Proof. Since 5 := min^ pqd > 0, it suffices to deal with the case where < t < 1 /\S. 
To this end, we combine the argument in [19j with relevant results from [161 118) . 

(a) Let to = t/2 and y £ D be fixed. Take 

f(x, s) = pf +tQ (x, y), x E D, s > 0. 
Applying Theorem 15.11 of the Appendix to the cube 

Q := B(x,p dD {x)) x [s - p dD (x) 2 /2,s] C Dx [-t ,t ], s < t , 
we obtain 

(2.9) |Vlog/(x,*)| < — (l + log-^- ), s<t , 

where A := supg / and cq > is a constant depending on the dimension and 
curvature on D. By [5} Theorem 5.2], 

(2.10) A < af (x, s + PdD {xf) , s e ]0, 1], x e D, 
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holds for some constant c\ > depending on D and L. Moreover, by the boundary 
Harnack inequality of [4j (which treats Z = but generalizes easily to non-zero C 1 
drift Z), 

(2.11) f{x,s + p aD {xf) < c 2 f(x,s), S6]0,l],i6fl, 

for some constant C2 > depending on D and L. Combining (|2.9|) . (|2.10p and 
p. 11| . there exists a constant c > depending on D and L such that 

(2.12) |Vlog/(a:,fl)| < x£D, se]0,t ] with PdD (x) 2 <s. 
(b) Let 

Cl={(x,s): xeD, se [0,t ], p d D(x) 2 > s} 
and B = sup^ /. Since d s f = Lf, for any constant b > 1, we have 



Next, again by d s f = Lf and the Bochner-Weizenbock formula, 

(t _a.)^E>- 2 ^, 

where fc > is such that Ric — VZ > — k on D. Then the function 



-/log-r 



(l + 2fc S )/ / 
satisfies 

(2.13) (L-d s )h>0 onL>x]0,oo[. 

Obviously /i(-,0) < 0, and (|2.12p yields h(x,s) < for s — pod{x) 2 provided the 
constant b is large enough. Then the maximum principle and inequality (|2.13|) 
imply h < on f2. Thus, 

bB 

(2.14) |V log f(x, s)\ 2 < (2k + s- 1 )\og—, (x,s)en. 

(c) If D is convex, by [H Theorem 2.1] with 5 = V* and t = 2t , we obtain 
(note the generator therein is ^L) 

f(x,t )= P2t (x,y)= pg (y, x) > cMv) t~ d/2 e-^ 2 / st ° , x e K, y e D 

for some constant c\ > 0, where tp > is the first Dirichlct cigenfunction of L on D. 
On the other hand, the intrinsic ultracontractivity for Pf implies (see e.g. |llj ) 

f(z,s) ^pf +to (z,y) < c 2 (p(y)tg {d+2)/2 , z,y£D, s<t Q , 

for some constant C2 > depending on D, K and L. Combining these estimates 
we obtain 

< c 3 ^V^) 2 / 8t °, xtK, s<t , 

for some constant C3 > depending on D, K and L. Hence by (|2. 14[) for s = to we 
get the existence of a constant C > such that 

iVlogp^.,^! 2 < (t^+2k) fc + log^ + ^l^ 



GRADIENT ESTIMATE AND HARNACK INEQUALITY 



7 



for &l\ y E D, x E K and to E ]0, 1[ with to < pdD(x) 2 . This completes the proof 
by noting that t = 2to- 

(d) Finally, if D is not convex, then there exists a constant a > such that 

(V N X, X) > -v\X\ 2 , X e TdD, 

where N is the outward unit normal vector field of 3D. Let / 6 C°°(D) such 
that / = 1 for pqb > e, 1 < / < c 2e<T for p dD < e, and N\ogf\ 9D > a. By 
Lemma 2.1 in [T5], d-D is convex under the metric g :— f~ 2 (-, ■). Let A, V and p 
be respectively the Laplacian, the gradient and the Riemannian distance induced 
by g. By Lemma 2.2 in [15] , 



L := A + = /" 



W. 



A + (d-2)/V/ 

Since Z) is convex under g, as explained in the first paragraph in Section 2 of [18j . 

g{Vp(y, -),V^)| 9D < 0, 

so that 

a(y) := sup5(Vp(y, ■), V<p) < oo, y E D. 

D 

Hence, repeating the proof of Theorem 2.1 in |16j . but using p and V in place of 
p and V respectively, and taking into account that / — > 1 uniformly as e — > 0, we 
obtain 

P 2 D t0 (x,y) > C 1 ( £ )^(y)t - d/2 e- C3 ^« a ^) 3 /8to 

>C x {e)<p{y)tf ,2 e- c ^ c ^ x >rf' %t ° 

for some constants Ci(e), C2(e), ^(e) > 1 with C^fe), C3(e) — > 1 as e — > 0. Hence 
the proof is completed. □ 



3. Gradient estimate for Dirichlet heat semigroup using coupling of 

diffusion processes 

Proposition 3.1. Let D be a relatively compact C 2 domain in M. For every 
compact subset K of D, there exists a constant C — C[K, D) > such that for all 
5>0,t>0,xoEK and for all bounded positive functions f on M , 

|V-P, D /(w)| 

<3 '" s sp,D ( /iog (to)) + c (w^-> + p < DfM - 

Proof. We assume that t E]0, 1[, the other case will be treated at the very end of 
the proof. 

We write VV — Z so that L — A+Z. Since Pf only depends on the Riemannian 
metric and the vector field Z on the domain D, by modifying the metric and Z 
outside of D we may assume that Ric — \7Z is bounded below (see e.g. [H]); that 
is, 

(3.2) Ric-VZ>-K 
for some constant k > 0. 
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Fix xq G K. Let / be a positive bounded function on M and X s a diffusion with 
generator L, starting at xq- For fixed £ < 1, let 

_ VP t D f(x Q ) 
V \VPt D fM\ 

and denote by u i— > the geodesies in M satisfying <p(0) = v. Then 
d 



dit 



P/ , /( V («)) = |VJV/(a:o)|. 

To formulate the coupling used in [T], we introduce some notations. 

If Y is a semimartingale in M, we denote by dY its Ito differential and by d m Y 
the martingale part of dY: in local coordinates, 

i_ „„ , rk ^ _d_ 

dx % 



dY = ay* + -r; fc (y) d(r J , r fc 



where r*- fe are the Christoffel symbols of the Levi-Civita connection; if dY 1 = 
dM l + dA 1 where M 1 is a local martingale and A 1 a finite variation process, then 

d m y = dM i A. 

Alternatively, if Q(Y) : Ty M — > TyM is the parallel translation along Y, then 

dY t = Q{Y) t d(j^ Q{Y)- l odY s 
and 

d m y t -Q(y) t div t 

where iV t is the martingale part of the Stratonovich integral L Q(Y)j 1 o dY s . 

For x, y G M, and y not in the cut-locus of x, let 
(3.3) 

d-l pp(x,y) 

I i x ,y) = y2 / {\^e(x,y)Ji? + {R{e{x 7 y), J i )J l + V &{x , y) Z,e(x,y))) a ds 

where e(x,y) is the tangent vector of the unit speed minimal geodesic e(x,y) and 
{Ji)i=i are Jacobi fields along e(x,y) which together with e(x,y) constitute an 
orthonormal basis of the tangent space at x and y: 

Ji(p(x, y)) = Px.yMO), i = l,...,d-l; 

here P x , y - T X M — > T y M is the parallel translation along the geodesic e(x,y). 

Let c G ]0, 1[. For h > but smaller than the injectivity radius of D, and £ > 0, 
let X h be the semimartingale satisfying Xq = ip(h) and 

(3.4) dX' s l = l\ \ d,,, A . + Z(X s h ) ds + ^ds, 

where 

e- (~ + «&) 

with n(Xg, X s ) the derivative at time of the unit speed geodesic from to X s , 
and P Xs x h '■ Px a M — > T x hM the parallel transport along the minimal geodesic 
from X s to Xg. By convention, we put n(x, x) = and P X)X = Id for all x G M. 
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By the second variational formula and ()3.2p (cf. [I]), we have 

dp{X s , X*) < il{X s , X*) - A _ K/l | ds < -A ds, s < T h , 

where 2\ := inf{s > : X s = X!}}. Thus, (X s , X% ) never reaches the cut-locus. In 
particular, Th < ct and 



(3.5) 



X, = XL 



Moreover, we have p(X s ,X%) < h and 



(3.6) 



\CV<h'[K+f t ) 



s > ct. 



1 \2 



We want to compensate the additional drift of X h by a change of probability. To 
this end, let 



and 



itf = exp (M*-±[M h ]?j 



Clearly R h is a martingale, and under Q h = R h ■ P, the process X h is a diffusion 
with generator L. 

Letting t(xq) (resp. r h ) be the hitting time of 3D by X (resp. by X h ), we have 

l{t<r h } < l{t<r(K )} + l{T(x )<t<"r'*}' 

But, since = X s for s > ct, we obtain 

l{r(a;o)<t<T' 1 } = l{r(a; )<ei} l{t<r''} ■ 

Consequently, 

~ {P t D f(<p(h)) ~ Pt D f( x o)) =j i E[f(X?)R> t l l {t<Th} - /(X t (0))l {t<T(xo)} ] 

- ^ E [•/'(X 4 ' l )^ i l{t<r(x n )} - /(X t (0))l{ t<T(:£ . )}] 
+ I" E [/(X t ' l )-R t ' l l{r(xo)<ct}l{t<r fc }] ) 



and since X/ 1 = X t this yields 
1 

(3.7) h 



(P t D fMh)) ~ PFfM) < E 



f( X t)l{t<T(x )}-^(Rt - 1) 

- T E [/(X t ' l )i?t l l{ T ( a;o )< ct }l{ t < r fc}] • 



The left hand side converges to the quantity to be evaluated as h goes to 0. 
Hence, it is enough to find appropriate lim sup's for the two terms of the right hand 
side. We begin with the first term. Letting 
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and noting that (n(X^ , X r ), P Xrt x h( ^mX r ) = >/2db r up to the coupling time Th 
for some one-dimensional Brownian motion b r , we have 



itf = exp ( M t h - -[M% ) < 1 + ~ -[M% + (Y t h ) 2 exp(Y t h ) 



= 1 + M* / \^\ 2 ds + (Y t h ) 2 exp(^). 



From the assumptions, exp(Y t h ) and Y t h /h have all their moments bounded, uni- 
formly in h > 0. Consequently, since / is bounded, 



lim sup I 

fi->0 

which implies 



lim sup E 



/(X t )W(* )}~ [I \£\ 2 dr+(Y t h feMY t h ] 



= 0, 



< lim sup E 
Using Lemma [53] and estimate 



f( x t)l{t<T(xo)}-^(R't ~ 1) 

1 f 

f{X t )l{t<T(x )}T / (£r ) Px r ,X>? d m X r ) 



we have for S > 



E 



/(^t)l{t<T(x )}"T / (€r i P X T ,X!?d-mXr) 



<SP t D [flo, 



f 



5P t D f(x )lagE 
( f 



<6P t D [f\o, 



\P t D f(xo 



l{t<r(x )} exp 
(no) 



1 

Sh Jo 



SP t D f (x )logE 



exp 



/ 1 



V<5 2 /i 2 J 



<5P t D [f\og 



<5P t D [ / log 



( f 



\P t D fM 
f 

P t D f(xo) 



ct 



( Xo ) + 8Pff{x )^ + k 



1 



5 2 \c 2 t 2 



y^{x )+^-p t D f{x ), 



where C' = 1 + (ck) 2 (recall that t < 1). Since the last expression is independent 
of ft, this proves that 

1 



lim sup E 



/(^t) 1 {t<r(o:o)}^(^t - 1) 



(3.8) 



<£Ff l/lo. 



/ 



P t D f(?o) 



{xo) + ^l PtDf{xo) - 



We are now going to estimate lim sup of the second term in (13. 7|) . By the strong 
Markov property, we have 

E [f( X t )Rt 1 {r(x )<ct}l{t<T''}] = E Q h [^ , t D ct/(^ct) 1 {r(2;o)<ct<T'>}] 

(3-9) <||F ( D ct /|| 00 Q' J {r(^)<ci<r' 1 }. 
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Since p(Xg, X s ) < h Ct S for s G [0, ct], we have on {r(x ) < ct < r h }: 



Pbd(X? (xo) ) < h 
For s e [0, r h - t(x )\, define 



h , ct-rjxo) 
i*°)>- h ct ■ 



Y'=p(X\ Xo)+s ,dD), 

and for fixed small e > (but e > ft), let S" = inf{s > 0, Y s ' = e or F s ' = 0}. Since 
under Q' 1 the process X^ is generated by L, the drift of p(X£,dD) is Lp(-,dD) 
which is bounded in a neighborhood of dD. Thus, for a sufficiently small e > 0, 
there exists a Q h -Brownian motion [3 started at 0, and a constant N > such that 

y s := ft ct - T } x °^ + y^g. + Ns > y;, se[0,5']. 
ct 

Let 

5 = inf {u > 0, y„ = e or y„ = 0}. 
Taking into account that on {t(xq) = u}, 

{Y' s , = s} U {S' > ct - u} c {T s = e} U {S 1 > ct - u}, 
we have for u £ [0, ct], 

Q h {ct < t^tM = u} < Q h {Y s , = e\t(x ) = u) + Q h {S' > ct - u\t{x q ) = u) 

< Q h {Y S = s\t(x ) = u) + Q h {S >ct- u\t(x q ) = u} 

< Q h {Y s = e\r(x ) = u} + [S\t(x ) = u] . 
Now using the fact that e~ NYs is a martingale and Y 2 — 2s a submartingale, we get 

Q h {Y S - e|r(a:o) =u}= 1 - & _ Ne < &h 

and 

E qh [S\t(x ) =u]< E qh \yI\t{x ) = u] 

<e 2 Q h {Y s =e\r(x ) = u} 

2 l-e- Nh ^ _ h(ct-u) 

for some constants C\,Ci > 0. Thus, 

Q h {ct < r h \r(x ) = u} < &h + C 2 1 1^L_A 

^ ' ct — u ct 

< Cl h + C 3 -<C 4 ^ 
ct t 
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for some constants 63,64 > (recall that t < 1). Denoting by £ h the density of 
r(xo) under Q h , this implies 

f-Ct 

Q h {T(x )<ct<T h } = / e h (u)Q h {ct <T h \a h =u}du 
Jo 

< C 4 - / £ h (u)du 
t Jo 

= C^q h {r(x )<ct}. 

In terms of D~ h = {x € D, p aD (x) > h} and o h = inf{s > 0, X* e dD-' 1 }, we 
have a h < t(xq) a.s. Hence, by Lemma \2. 31 

Q h {r(x ) < ct} < Q*{o* < ct} < Ccxp j- ^-^l j , 

where we used that is generated by L under Q h . This implies 

(3.10) Q h {r(x ) <ct<r h }< C,\ exp {-^^j ■ 

Since - (P t D (ip(h)) - P t D (x )) converges to |VP t D /(a;o)|, we obtain from (|3Jf . 
dnU, (EH) and (l3~TUD . 

|Vif /(*o)| < SP t D (flog (*—)) (x ) 



(3.11) + ^-P t D f(xo) + C 5 \\P t D ct f\\J- exp t <' M 



a5i " v "' J " I - c " 1100 1 r i 16ct 

Finally, as explained in steps c) and d) of the proof of Proposition 12.51 for any 
compact set K C D, there exists a constant C(K, D) > such that 

\\PF- c J\\ o<^ c{K - D)/t PFf^o), ce [0,1/2], .oeif, te]0,i]. 

Combining this with (|3.11|) . we arrive at 

/ 



|VP t D /(x )| <<5P t D (/log 

(3.12) 



*V7(*o) 



)(x ) + ^Pff(x ) 



L J / P9flW\ f C(K,D) \ pD „, 
Finally, choosing c such that 

1 dist( K,dD) 

0<C< 2 A TecWf' 

we get for some constant C > 0, 

(3.13) |Vif /(» )| < SP t D (/log ( pO L 0) )) (*o) + C (J- t + l) P t D /(x ), 

x € K, 5 > 0, 

which implies the desired inequality. 
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To finish we consider the case t > 1. From the semigroup property, we have 
P t D f = Pf(P£if). So letting g = P t D 1 f and applying (I3T3]) to g at time 1, we 
obtain 

\VP t D f(x )\ < 8P? (g log ( pcLx )) ^ + C Q + l ) P i D 9M- 

Now using P^g = Pff, we get 

|Vif/(*o)| < ^(fllogffjfxo) - Pt D f( x o) ^gP t D f(x ) +c(± + l\ P t D f(x ). 

Letting y(x) = xlogx, we have for z *E D 

glogg(z) = <p (E [/(X t _i(«))l {t _ 1<rW} ]) 

<E[vj(/(jrt-i(«))i { t-i<rW})] 

= E[^(/)(X t _ 1 ( 2 ))l {t _ 1<r(z)} ] 
= P£i(/log/)(z), 

where we successively used the convexity of ip and the fact that ip(0) = 0. This 
implies 

iDtt„\\ /rnD|fi„J / \ \ („ \ , n I 1 i 1 1 D-D 



|VP^/(x )| < «T (/ lo S I p5^ J J (*<>) + c U + lj i?7(*o), 
which is the desired inequality for f > 1. □ 

4. Proof of Theorems 11.11 and Theorem 11.21 

of Theorem ] 1.1[ We assume that t e]0, 1[ and refer to the end of the proof of 
Proposition 13 . 1 1 for the case t > 1. Fixing S > and xo S M, we take i? = 160/(5 A 
1). Let D be a relatively compact open domain with C 2 boundary containing 
B(xq, 2R) and contained in B(xq, 2R + e) for some small e > 0. By the countable 
compactness of M, it suffices to prove that there exists a constant C = C(D) 
such that (jl.4p holds on B(xq,R) with C in place of P(<5 A l,Xo)- We now fix 
x e B(x ,R), t e ]0, 1] and / € Without loss of generality, we may and will 

assume that Ptf(x) = 1. 

(a) Let P s (x,dy) be the transition kernel of the L-diffusion process, and for 
xeD.ze M, let 



v a (x,dz) = / h x (s/2,y)P t - a (y,dz)u(dy), 

JdD 

where v is the measure on 3D induced by n(dy) — e v ( y >dy. By Lemma [2~2l we have 
P t f(x) = P t D f(x) + f p° 2 (x,y)f(z)ds»(dy)v s (y,dz). 



Then 

|VP t /(x)| < |VP t D /(x)| 



l]0,t]xDxM 

(4.1) =: h+I 2 . 



]0,t]xDxM 



I V logpf /2 (- , y)(x)\pf /2 (x, y)f(z) ds^{dy)v s (y, dz) 
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(b) By Proposition 13. l\ we have 

(4.2) h <SP t D (f log f){x) + ^ + c(^ + lj , x£B(x ,R), te]0,l[, 6>0 

for some C = C(D) > 0. 

(c) By Proposition 12.51 with e — 1, we have 

-Clogte + s" 1 ) 2p(x,y) 



(4.3) J 2 < 



]0,(]xMxD 



Ps/2( x , V) /O) ds "s(y, dz)fj,{dy) 



for some C = C(D) > and all t S ]0, 1]. Applying Lemma [2.41 to the measure 
p, := pf, 2 (x, y) ds v s (y, dz)n(dy) on E := ]0, (] X M X so that 



= P(r(a;) < t < £(z)) < 1, 



we obtain 



h < 8E [(f\ogf)(X t (x))l MxHt<a , x)} ] + - + 5E [/(X t (a;))l {T ( x) < t<c(B)} ] 



x lop 



exp 



]0,t]xMx.D 



fClog(e + s x ) 2p(x,y)\ D , . , . . 
| + — ^ — f As Ps/2( x iy) v s{yAz)n{dy) 



< SE [(flogf)(X t (x))l {T{x) < t<i{x)} ] + -+SE [f(X t (x))l {Tix) < t<ax)} ] 



(4.4) 



where 



We get 



x log 



]0,i]xMxO 



exp {j + sf } ds p^/^ x ->y) Vs< <y-' Az ) ^ d y)-< 



A := sup {C-\Arlog(e + r) — r} < oo. 

r>0 



/a < 5E [(/log/)(Xt(x))l {T{x) <t< c(lB )}] + - 

+ <5E [f(X t (x))l {T{x) < t<a , x)} ] ( logE[exp (9i?/Jr(x)) ] 



o 

<5E [(/log/)(X t (x))l {T(x) < t<?(x)} ] + - + SlogE[exp(9R/Sr(x))] + A 
< SE [(f\ogf)(X t (x))l {T{x) < t<a . x)} ] 



51ogE 



exp 



9R 



6 All 



(4.5) 



(5 A 1)t(x) 

<SE [(/Iog/)(X t (x))l {T(x) < t<c(lB)} ] 

9i? 



A- 



(<5 A 1) logE 



exp 



(S A l)r(x) 



A 
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By Lemma 12.31 and noting that po(x) > R, we have 



E 



exp 

= 1 - 
= 1 - 
< 1 - 
= 1 - 



{5 A l)r(x) 
°° 9Rs 



< 1 +E 
9Rs 



9R 



o (<5A1) 
9R 



exp 



(5 A i)t(» 
d 



exp 



9R 



(5 A 1)7 ds 



9R 



(SA1)J \(SA1) 



1 exp 



(<5A1)t(x) 
(-P{t(x) < s- 1 }) ds 
9Rs 



9R 



(SA1)J \(5A1) 



91? 



-s + 1 exp 



9R 



9R 



1 exp 



(SA1) 

9Rs 

JSAl) 
-Rs 



{t(x) < s^jds 
PV 



exp 
ds 



1G 



ds 



(SA1)J \(SA1) 

= 1 + 9 (9u + 1) exp (-u) du =: A 1 , 
Jo 

since R= 160/ (5 A 1). This along with (j4~5]) yields 

(4.6) 7 2 < <5E [(/log/)(Jft(a:))l {T(!ll) < t<fi{x)} ] + log A' + A + - . 

The proof is completed by combining with (HTTjl . P~2")) and 
o/ Theorem ] l.SX By Theorem ll.il 

|VP t /(x)| < <5(P t (/log/)(x) - (P t /)(x) log P t /(x)) 



□ 



(4.7) 



F(<5 A l,x) 



<5(t A 1) 



+ 1 ) + ^ ) P t /(x), <S > 0, x e M. 



For a > 1 and x 7^ y, let /?(s) = 1 + s(a — 1) and let 7 : [0, 1] — > M be the minimal 
geodesic from x to y. Then (7! = p(x,y). Applying (|4.7[) with S 



obtain 
d 
ds" 



ap(x,y) 



, we 



log(P t /"W) 



(is) 



«(a-l) Pt(/"W log [P t fPW) log PtfW 



(3(s) 2 Ptf f3{s 



(is) 



> 



(3(s) PtffW 
ap(x,y) \ a-1 



(7s) 



(3(s)P t f^)( ls )\ap(x,y) 
- \VP t f^(ls)\ 

a-1 



(p t (f {s hogf^) - (P f {s) ) log p f f^)( ls ) 



> -F 



ap(x,y) 



A 1,7, 



2 p 2 (x,y) 



> -C(a,x,y) 



ap 2 {x,y) 
(a- l)(t A 1) 



f3(s)(a — l)(t A 1) /?(s) 
2(o-l) 



p(x,y)^ 2(a-l) 
e/3(s) 



+ p(x,y) - 



where C(a,x,y) := sup se 
nack inequality. 



[0,1] a \ ap(x,y) 



A 1,7s). This implies the desired Har- 
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Next, for fixed a G]l, 2[, let 

K(a, t, x) = sup {C(a, x, y) : y G B(x, V2t)}, t > 0, x G M. 
Note if (a, £, x) is finite and continuous in (a, i, x) G]l, 2[x]0, 1[xM. Let p := 2/a. 



For fixed t G]0, 1[, the Harnack inequality gives for y £ B(x, \/2t), 

(Ptf(x)) 2 <(P t f a (y)rexpl [ ^-^+2K(a,t,x) (J^ + V^j } . 
Then choosing T > t such that q ;= p/2(p - 1) < T/i, 

/i(g(x, V2i)) exp |- 2(2 ~ p) - 2K(a, t, x) (J^ + V2t\ - jT=^} (^/fr)) 2 

< / (^/ a (y)) p exp(- ^' y)2 j^dy). 



Similarly to the proof of [U Corollary 3], we obtain that for any S > 2, choosing 
a = G]l,2[ such that S > -^-^ = > 2, there is a constant c(S) > such 
that the following estimate holds: 



£,(.,-./) : = ^M^expj^^ul,, 



exp{cfflA>,^)(l + V2l)} 

< = , t > 0, x G iw. 

n(B(x,V2l) 

By [51 Eq. (3.4)], this implies the desired heat kernel upper bound for Cg(t,x) := 
c(£)#(a,t,a;)(l + \f2t). □ 

5. Appendix 

The aim of the Appendix is to explain that the arguments in Souplet-Zhang [T3] 
and Zhang [19] for gradient estimates of solutions to heat equations work as well 
in the case with drift. 

Theorem 5.1. Let L = A+Z for a C 1 vector field Z . Fixxq 6 M and i?, T, t > 
such that B(xq,R) C M. Assume that 

(5.1) Ric - VZ > -K 

on B(xq,R)- There exists a constant c depending only on d, the dimension of the 
manifold, such that for any positive solution u of 

(5.2) d t u = Lu 
on Qr^t '■= B(xq,R) x [i — T,to], the estimate 

< c(i + T-^ + VK)(l + log 



|Vlogu| <c^-+T- 1 ' 2 + VK)[l + log 
holds on Qr/2,t/2- 

Proof. Without loss of generality, let N := supg T R u = 1; otherwise replace u by 
u/N. Let / = log it and lu = )-^/J\» ■ By (|5.2p we have 

Lf+\Vf\ 2 ~d t f = 
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SO that 

d t u> 

(5.3) 



Moreover, 
(5.4) 



2(V/,V9t/) 2[V/| 2 9 t / 

(I-/) 2 (I-/) 3 
2(V/,V(£/+|V/| 2 )) 2|V/| 2 (£/ + |V/| 2 ) 

(1 - /) 2 (1 - /) 3 

2(V/,V(A/+|V/| 2 )) 2|V/| 2 (A/+|V/| 2 ) 

(I-/) 2 (I-/) 3 
2(V v/ Z,V/) + 2He S s / (V/,Z) 2 | V/| 2 (Z, V/) 

(I-/) 2 (I-/) 3 
(Z,V|/| 2 ) , 2|V/| 2 (Z,V/) 



= Au 



Au - 



(I-/) 2 (I-/) 3 
2Hes S/ (V/,Z) 2|V/| 2 (Z,V/) 



(I-/) 2 (I-/) 3 ' 

Finally, by the proof of [H (2.9)] with -k replaced by Ric(V/, V/)/| V/| 2 , we 
obtain 



Aw 

(5.5) 



2(V/,V(A/ + |V/| 2 )} , 2|V/| 2 (A/ + |V/| 2 



(I-/) 2 (I-/) 3 

2/ , , , x ,,2 2wRic(V/,V/) 
^ ^4<V/,Vw)+2(l-/V 2 + - ' 



' ' |v/| 2 

Combining (JOJ, (jO)) . (j5l| and (|53|) . we arrive at 

- a t cj > -^-j (V/, Vw) + 2(1 - / )w 2 - 2Ku. 

This implies the desired estimate by the Li-Yau cut-off argument as in [13]; the 
only difference is, using the notation in 13J, in the calculation of — (Aip)ui after 
Eq. (2.13) in [13]. By (|5.ip and the generalized Laplacian comparison theorem (see 
[31 Theorem 4.2]), we have 

d 



Lr < VKdcoth {^JK/dr) < - + VKd, 
and then 

-(Lip)u> = -(d^j + (d r ip)Lr)uj < (\d r ^\ 2 + \d r i/j\ * + y/Wd \d r i(j\\ u. 

The remainder of the proof is the same as in the proof of [13[ Theorem 1.1], using 
Lij) in place of Aip. □ 
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